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Abstract A linear chord diagram canonically determines a fatgraph and 
hence has an associated genus <?. We compute the natural generating func- 
tion C g (z) = X^n>o c s( n )- z ™ f° r the number c g (n) of linear chord diagrams of 
fixed genus g > 1 with a given number n > of chords and find the remarkably 
simple formula C g (z) = z 2g R g (z)(l — 4z)5 _3 9 ; where R g {z) is a polynomial 
of degree at most g — 1 with integral coefficients satisfying R g {\) ^ and 
R g (0) = Cg(2g) 7^ 0. In particular, C g (z) is algebraic over C(z), which gen- 
eralizes the corresponding classical fact for the generating function Cq(z) of 
the Catalan numbers. As a corollary, we also calculate a related generating 
function germaine to the enumeration of knotted RNA secondary structures, 
which is again found to be algebraic. 
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1 Introduction 

A linear chord diagram consists of a line segment called its backbone to which 
are attached a number n > of chords with distinct endpoints. These combi- 
natorial structures occur in a number of instances in pure mathematics includ- 
ing hnite type invariants of knots and links pnil9] , the representation theory of 
Lie algebras [5] , the geometry of moduli spaces of flat connections on surfaces 
[2J[3], mapping class groups pQ and the Four-Color Theorem [5], and in ap- 
plied mathematics including codifying the pairings among nucleotides in RNA 
molecules |28j . or more generally the contacts of any binary macromolcculc 
[2"M2l?II5o] . and in the analysis of data structures [TU1ITT] . 

As the title indicates, this paper is dedicated to enumerative problems 
associated with linear chord diagrams. It is obvious that there are (2n — 1)!! 
many distinct linear chord diagrams with n chords. Partly because of their 
relevance to the number of finite type invariants of knots, which corresponds 
to a quotient of the collection of linear chord diagrams, sophisticated related 
enumerative problems have been studied in jT2lll8, 21, 33,371. In contrast to 
these, our approach depends upon a certain filtration of the collection of all 
linear chord diagrams as follows. 



Fig. 1 Equivalence between a pairwise identification of the sides of an octagon (left), the 
surface F(G) where all vertices of the backbone are collapsed into a single vertex (note we 
therefore have 2 — 2g — r = 1 — n) (middle) and the surface F(G) induced by a linear chord 
diagram G (right). 

Drawing a picture of a linear chord diagram G in the plane with its back- 
bone lying in the real axis and the chords in the upper half-plane determines 
a cyclic ordering on the half-edges of the underlying graph incident on each 
vertex, thus defining a corresponding "fatgraph" G to which is canonically 
associated a topological surface F(G) (cf. £)2.3[) of some genus; see Figure [T] 
It follows that to each linear chord diagram G is naturally associated the cor- 
responding genus of .F(G), and we let c g (n) denote the number of distinct 
linear chord diagrams with n chord of genus g with corresponding generating 
function C g (z) = J2 n >o c s( n ) z ™' f° r eacn 9 — 0- 

In particular, the Catalan numbers co(n), i.e., the number of triangulations 
of a polygon with n + 2 sides, also enumerate linear chord diagrams of genus 
zero. Their recursion is evidently given by c$(n + 1) = X)"=o c o(i) c o( n — i) 
with basis cq(0) = 1, which implies Cq(z) = J2n>o c o( n ) zn = 1 + z[Cq(z)] 2 , 



b 
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whence 



and so 



since 



l-vT= Tz 
C (z) = - 



c (n) 



2z l + Vl-4z' 

2n\ 1 (2n)! 



n y n + 1 (n + l)!n! 



n>l 



In fact, these numbers c g (n) had been computed in another generating 
function over two decades ago by Harer-Zagier [Tl] in the equivalent guise of 
the number of side pairings of a polygon with 2n sides that produce a surface 
of genus g, cf. Figure [TJ namely, 

n>0 2g<n v ; v 

a striking and beautiful formula which is the starting point for our compu- 
tations and for which we therefore also provide a novel proof in Lemma [1] 
depending only on the character theory of the symmetric group. Indeed, this 
formula was a crucial intermediate step for the calculation of the virtual Euler 
characteristic of Riemann's moduli space given in |14j and independently in 
[23| using a novel matrix model. 

The topological filtration of linear chord diagrams by genus discussed here 
was also considered in [5] , where enumerations for genus one and for maximal 
genus were obtained. Furthermore, there is physics literature initiated in [52] 
on RNA enumeration based on matrix models which relies on the genus of 
linear chord diagrams including the derivation of another expression for the 
c g (n) in terms of Lagucrre polynomials |34j and the comparison of expected 
with observed genera [71155]. 

Though the numbers c g (n) have thus been known in various forms for 
some time, the natural generating functions C g (z) have not been computed 
heretofore, and it is this that we accomplish here. In fact, we shall prove in 
Theorem Q] that for any g > 1 



C g (z)= P g (z) x 



(1 - 4z) 3 s ' 

where P g (z) is a polynomial defined over the integers of degree at most (3g— 1) 
that is divisible by z 2g with P ff (l/4) ^ 0. In particular and surprisingly, C g (z) 
is algebraic over C(z) for all g > 1 just as is the Catalan generating function 
Cq(z), which our results thus generalize. 

In fact, the polynomials P g {z) empirically have degree exactly 3g— 1 and all 
their coefficients are positive. These attributes of positivity and exact degree 
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have yet to be proved however. The first several such polynomials are given 
below. 



P 2 (z) = 21z 4 (z + 1) 

P 3 (z) = llz 6 (158 z 2 + 558 z + 135) , 

Pi(z) = 143z 8 (2339 z 3 + 18378 z 2 + 13689 z + 1575) , 

P 5 {z) = 88179z 10 (1354 z 4 + 18908 z 3 + 28764 z 2 + 9660 z + 675) . 



In light of these properties, it is natural to speculate that the polynomials 
P g (z) themselves solve an enumerative problem, and it is interesting in this 
vein to compare coefficients for P 2 and P 3 with fatgraph tables such as those 
in [20] . One might further speculate that there may be a purely combinatorial 
topological proof of Theorem [T] based on a construction that in some way 
"inflates" these structures using genus zero diagrams, to wit 



Motivated by enumerative problems for RNA, we also study a further class 
of combinatorial objects as follows. A "partial linear chord diagram" is a col- 
lection of chords attached to a backbone with n vertices, where we now drop 
the condition for a linear chord diagram that each vertex has an incident chord. 
Furthermore, two distinct chords with respective endpoints i± < ji and i 2 < j 2 
are "consecutively parallel" if i\ = i 2 — 1 < j 2 = ji — 1, consecutive parallelism 
generates the equivalence relation of "parallelism" whose equivalence classes 
are called "stacks" . A chord connecting vertices which are consecutive along 
the backbone is called a "1-chord" . 

A "macromolccular diagram of minimum helix length cr > 1 on n > 
vertices" is a partial linear chord diagram on n vertices with no 1-chords 
so that each stack contains at least a edges; let d SifT (n) be the number of 
all such partial linear chord diagrams of genus g with generating function 
D 9i „(z) = X)n>o d-g,<?{n)z n . We compute this generating function for g, a > 1 
in Theorem [5] to be 



c g (z) = p g (z)(VT^Tz) 



l-6g 





where u IJ {z) = z L_ z 2_i_ 1 ■ In particular, D s <T (z) is also algebraic over C(z), 
and for arbitrary but fixed g and 72 ~ 1.9685, we have 
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for some constant k g . 

The exponential growth rate of 1.9685 shows that the number of macro- 
molecular diagrams grows much more slowly than the number of RNA se- 
quences over the natural alphabet. This implies the existence of neutral net- 
works [TTII2W50] . i.e., vast extended sets of RNA sequences all folding into a 
single macromolecular structure. These neutral networks are of key importance 
in the context of neutral evolution of RNA sequences. 

2 Background and Notation 

We formulate the basic terminology and notation for graphs and linear chord 
diagrams, establish notions and notations for the symmetric group, recall the 
fundamental ideas and constructions for fatgraphs, and finally combine these 
ingredients for application in subsequent sections. 

2.1 Graphs and linear chord diagrams 

Let G be a finite graph in the usual sense of the term comprised of vertices 
V(G) and edges E(G), where edges do not contain their endpoints and are 
not necessarily uniquely determined by them; in other words, G is a finite 
one-dimensional CW complex. Removing a single point from an edge produces 
two components, each of which is called a half-edge of G. A half-edge which 
contains a vertex v in its closure is said to be incident on v, and the number 
of distinct half-edges incident on v is its valence. 

Let B = B n denote the closed interval [l,n] of real numbers between 1 
and n > 2 regarded as a graph with V(B) = B nZ and E(B) = {(i,i + 1) : 
i = 1, ... ,71 — 1}. We shall refer to the least and greatest elements of V(B) 
as extreme vertices, which are univalent in B, and to the other vertices of B 
as interior vertices, which are bivalent in B. A partial linear chord diagram C 
on 7i > 2 vertices is a graph containing B so that V(C) = V(B), each interior 
vertex of C has valence at most 3, and the extreme vertices have valence 
at most 2. B is called the backbone of C, edges in E(B) C E(C) are called 
backbone edges, and edges in the complement E(C) — E(B) are called chords. 

In particular, C is called a linear chord diagram if every interior vertex 
has valence exactly three and the extreme vertices have valence exactly two; 
in particular, the number of vertices for a linear chord diagram is necessarily 
even. 

Two distinct chords ei,e2 <E E(C) — E(B) in a (partial) linear chord dia- 
gram C with respective endpoints i\ < ji and 72 < ji are consecutively parallel 
if i\ = ii — 1 < ]2 = ji — 1. Consecutive parallelism generates the equivalence 
relation of parallelism on E(C), and equivalence classes are called stacks. A 
linear chord diagram in which every stack has cardinality at most one is called 
a shape. 

Let 3^(n) denote the collection of all partial linear chord diagrams on 
n vertices and ^(n), S^{n) the collections of all linear chord diagrams and 
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shapes on 2n vertices, respectively, so in particular, we have the inclusions 
&(2n) D ^(n) D S^in). There is a natural projection 

d : U n >i^(n) -> U„>iJ^(n) 

defined by contracting to a point any backbone edge at least one of whose 
interior endpoints is not trivalent or extreme endpoints is not bivalent as well 
as collapsing each non-empty stack onto a single chord, i.e., all least vertices 
of chords in a stack are collapsed to a single vertex and likewise all greatest 
vertices. 

Define a 1- chord in a (partial) linear chord diagram to be a chord connect- 
ing two consecutive vertices i and i + 1 in the backbone. 

f-chords are typically proscribed in the partial linear chord diagrams that 
arise in applications to RNA owing to tensile rigidity of the backbone. Further- 
more, stacks of small cardinality are typically energetically unfavorable, and 
one introduces a parameter a > 1 specifying the minimum allowed cardinality 
of a stack. A macromolecular diagram of minimum helix length a > 1 on n > 
vertices is a partial linear chord diagram on n vertices with no 1-chords so that 
each stack contains at least a edges; let @ a {n) denote the collection of all such 
partial linear chord diagrams. 

Our main results solve enumerative problems for linear chord diagrams, 
shapes, and macromolecular diagrams of fixed minimum helix length. 



2.2 Permutations 

The symmetric group S2n of ah permutations on 2n objects will play a key role 
in our calculations. We shall adopt the standard notation writing (ii, ■ ■ ■ , ik) 
for the cyclic permutation i\ t— > %2 i— > • • • i— > i k \— > i\ on distinct objects 
i\, . . . , ik and shall compose permutations tt, t from right to left, so that tt o 
r(i) = 7t(t(i)). An involution i is a permutation so that i o i is the identity. 

The conjugacy class of 7r e Sm will be denoted [tt]. Conjugacy classes in 
S2n are identified with classes of partitions of {1, . . . , 2n}, where in the stan- 
dard slightly abusive notation, tt £ [tt] — [l iri • • • 2n 7r2 "] denotes a partition 
comprised of tt^ > many parts of size k, for k = 1, . . . , 2n, i.e., tt is com- 
prised of 7Tfe many k cycles of pairwise disjoint supports, for k = 1, . . . , 2n, so 
necessarily Y2k=i = 2n. A permutation tt is an involution if and only if 
7r G [l iri 2 7r2 ], and it is fixed point free if and only if 7Ti = 0. The number of 
elements in the class [tt] is given by 

M = I [J I = —J± . 



The irreducible characters x of iS„ are labeled by Young tableaux F. See 
|31| for further details and background. 
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2.3 Fatgraphs 

A fatgraph G is a graph G together with the specification of a collection of 
cyclic ordcrings, called the fattening, one such cyclic ordering on the half-edges 
incident on v for each v € V(G). 

A fatgraph G uniquely determines an oriented surface F(G) with boundary 
as follows. For each v £ V(G), consider an oriented surface isomorphic to a 
polygon P v of 2k sides containing in its interior a single vertex of valence k 
each of whose incident edges are also incident on a univalent vertex contained 
in alternating sides of P v , which are identified with the incident half-edges in 
the natural way so that the induced counter-clockwise cyclic ordering on the 
boundary of P v agrees with the fattening of G about v. The surface F(G) is 
the quotient of the disjoint union \J V £V(G)Pv, where the frontier edges, which 
are oriented with the polygons on their left, are identified by an orientation- 
reversing homeomorphism if the corresponding half-edges lie in a common 
edge of G. This defines the oriented surface F(G), which is connected if G is 
and has some associated genus g(G) > and number r(G) > 1 of boundary 
components. 

The various trees in the polygons P v , for v G V(G), combine to give a 
graph identified with G embedded in F(G), so that we regard G C F(G). In 
fact, G is a deformation retraction of the surface F(G) by construction, so 
their Euler characteristics agree, namely, 

X (G) = W(G) - #E{G) = 2 - 2.9(G) - r(G) = x(F(G)) 

provided G is connected, where ^ denotes cardinality. 

A fatgraph G is uniquely determined by a pair of permutations on the 
half-edges of its underlying graph G as follows. Let vt > denote the number 
of fc-valent vertices, for each k > 1, . . . , K, where K is the maximum valence 
of vertices of G and 

K 

J2 kv k = 2#E{G) = 2n 
fe>i 

is the number of half-edges. The valencies of vertices of G are thus succinctly 
described by a permutation in the conjugacy class [1" 1 2' U2 • • ■ K'" K ]. 

In order to explicitly determine a permutation in this class, specify a linear 
order on V(G) as well as a distinguished half-edge incident on each v € V(G). 
This determines a unique linear ordering on the half-edges of G which restricts 
to the the fattening at each vertex, where the distinguished half-edge is least, so 
that one half-edge furthermore precedes another if it is incident on a preceding 
vertex. There is thus a well-defined permutation r € [l"^" 2 • • • K VK ] C S 2 „ 
whose disjoint cycles correspond to the fattenings at each vertex. 

The second permutation i £ S^n is the product i = Jl"=i C 1 ) ovcr a ll 
edges e S E(G) of disjoint transpositions (h, h'), where the distinct half-edges 
h, h' lie in the common edge e. Thus, whereas the permutation r determines 
the valencies of vertices, the fixed-point free involution i 6 [2 n ] determines the 
edges of G. 
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Several basic facts follow from this representation of a fatgraph G as a pair 
t,i£ S^n of permutations. One important point that is easy to confirm is that 
the boundary components of F(G) are in one-to-one correspondence with the 
cycles of r o t, i.e., 

r(G) = the number of disjoint cycles comprising toi 
= (t o 1)1 + (r o b) 2 H h (r o t) 2 „. 

Furthermore, isomorphism classes of fatgraphs with vertex valencies (vk)^ =1 
are evidently in bijection with conjugacy classes of pairs r, t G S^n, where 
t G [l Ul 2" 2 ■ ■ • K VK ] and t G [2"]. In particular, as a data type on the computer, 
fatgraphs are easily stored and manipulated as pairs of permutations, and var- 
ious enumerative problems can be formulated in terms of Young tableaux. 

See [2"3"I|2"5"] for more details on fatgraphs and [51 [H1[2771[2^1[TC] for exam- 
ples of fatgraph enumerative problems in terms of character theory for the 
symmetric groups. 

2.4 Fatgraphs and linear chord diagrams 

A regular planar projection of a graph in 3-space determines a corresponding 
fattening on it, namely, the counter-clockwise cyclic ordering in the plane of 
projection. The crossings of edges in the plane of projection can be arbitrarily 
resolved into under/over crossings without affecting the resulting isomorphism 
class. Furthermore, a band about each edge can be added to a neighborhood 
of the vertex set in the plane of projection respecting orientations in order 
to give an explicit picture of the associated surface embedded in 3-space. 
An example with a single 8-valent vertex is illustrated in the middle of Fig- 
ure [TJ this fatgraph G can be described by the pair r = (1,2,3,4,5,6,7,8), 
l = (1,5)(2,3)(4,7)(6,8), and indeed, the cycles of tol = (1, 6)(2, 4, 8, 7, 5)(3) 
correspond to the boundary components of F(G), which has Euler character- 
istic -3, r(G) = 3, and g(G) = 1. 

The standard planar representation of a (partial) linear chord diagram C 
represents the backbone as a real interval and non-backbone edges as semi- 
circles in the upper half plane as in the example on the bottom-right in Figure 
[TJ This planar projection thus implicitly determines the canonical fattening C 
of C as above. An example is given on the top-right in the figure with corre- 
sponding permutations given by r = (1, 2)(3, 4, 5)(6, 7, 8) ■ • • (18, 19, 20)(21, 22), 
l = (2, 7)(4, 13)(10, 21)(16, 19), where vertices are ordered left to right and 
rightmost backbone half-edges are distinguished to determine the linear or- 
dering on half-edges. 

Given a (partial) linear chord diagram C with its corresponding fatgraph 
C, consider the graph G arising from C by collapsing its backbone to a single 
vertex together with its fattening G induced from C in the natural way; for 
example, the fatgraph G in the middle of Figure [T] arises in this manner from 
the fatgraph C on the top-right. We claim that the surfaces F(C) and F(G) 
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have the same genus and number of boundary components, and indeed, it 
follows by construction that the two surfaces are homcomorphic and hence 
have the same invariants. 

In particular, a linear chord diagram C on 2n vertices gives rise to a fat- 
graph with a single vertex of valence of 2n and a distinguished half-edge 
(namely, the one coming just after the location of the collapsed backbone), 
i.e., a pair of permutations r G [2n], i G [2™]. We may thus define the number 
of boundary components and genus of a linear chord diagram 

r(C) = r(Gc) = the number of disjoint cycles comprising r o t, 
g{C) = g{& c ) = \{n + l-r(C)). 

Conversely, the specification of a pair t G [2n], l G [2™] uniquely determines 
a linear chord diagram on 2n vertices. Equivalently, the enumeration of pairs 
r G [2n], i G [2™] corresponds to all possible edge-pairings of a polygon with 
2n sides, as illustrated for the ongoing example on the left in Figure [1] Sum- 
marizing, we have the following: 

Proposition 1 The following four sets are in bijective correspondence 
^(n) = {chord diagrams on 2n vertices}, 

{univalent fatgraphs with n edges and a distinguished half — edge}, 
{edge — pairings on a polygon with 2n labeled sides}, 
{pairs t, l G S 2n ■ r G [2n] and i G [2"]}. 

Our first counting results will rely upon the bijection between ^(n) and 
pairs of permutations in Sin established here. The subsequent more refined re- 
sults are tailored to the macromolecular diagrams of interest in computational 
biology. 



2.5 Topological Filtrations and Generating Functions 

Let ^gin) 3 S* g (n) denote the collections of all linear chord diagrams and 
shapes of genus g > on 2n > vertices with respective generating functions 

C g {z) = Y^c g {n)z n , 

n>0 
n>0 

where c g (n) = s g (n) = if 2g > n since r = n + 1 — 2g has no positive solution 
r > unless 2g < n. 
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Likewise, let ^ g (n, m) D ^(n, m) denote the collections of all linear chord 
diagrams and shapes of genus g > on 2n > vertices containing m > 
1-chords with respective generating functions 

c g (x,y) = c g (n,m)x n y m , 

rn , n > 

S g (x,y)= ^ s g (n 7 m)x n y m 7 

m,n>0 

where c g (n, m) — s g (n, m) = if 2g > n or if m > n. 
Notice that the projection ■& restricts to a surjcction 

•d : \J n > %(n,m) -> U„>o^ 9 (n,m) 

which collapses each stack to a chord and therefore evidently preserves both 
the genus g and the number m of 1-chords. For any shape 7 € U n >o^(n., m), 
let 

^ 7 (n, m) = <*f(n, m) n i?~ ^7) 

denote the intersection with the fiber •d~ 1 ( r y) with its generating function 
C 7 (x,y). 

Turning finally to macromolecular diagrams, let ^ SjCr (n) denote the subset 
of S$a{n) comprised of diagrams with genus g and let 

n>0 

denote the corresponding generating function. Again, the projection •& restricts 
to a surjection 

■d : U„> ^ SlO -(n) -> U n > y g (n), 
which preserves the genus. For any shape 7 G \J n >oS^(n), let 

%, a {n) =^(n)n^ 1 ( 7 ) 

denote the intersection with the fiber •d~ 1 (p[) with its generating function 
D 7j0 .(z). 

As a general notational point for any power series P(z) = X] a i z% : we shall 
write [z l ]P(2;) = a.; for the extraction of the coefficient as of z 1 . 

3 The generating function of linear chord diagrams of genus g 

We introduce the polynomial 

P(n,x) = c s (n)-x" +1 - 2 «, (2) 

{3|2g<n} 

which plays a key role in our computation of C g (z). 
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Lemma 1 Letting x, z denote indeterminates, we have 

~ P(n,x) +1 _ (1 + zY , 3) 

Proof Since the number of boundary components r = n + 1 — 2g of the fat- 
graph corresponding via Proposition [T] to a pair r, t of permutations equals 
the number of disjoint cycles comprising r o i, we compute 



P(n,N)= Y c g (n) N n+1 - 29 = NT,ATLh 

{g\2g<n} te[2"] 

= Yn^ y 1 



t £[2»] 
"E[,] 



E^ EiT< E *w,p»]-*i 



TCT , 7T 7 



where r S [2n] is any fixed permutation and S denotes the Kronecker delta 
function. 

We claim that 

(2n-l)!! v-X r ([2"])x y Wx y ([2n]) 



o-es 2 „ 



11,/" -.. 



E 



x r ([1 2„i 



(4) 



To this end, the orthogonality relations 



Y, \ Y I \ ( 2n V- x 
v 1 1- 



of the second kind give 

E ^HP"] • ^tHM = E 



I [2"] I 
(2n)! 



(2n-l)!! 
ll,.r 



^X^(a) X y ([2"]) 

Y 

Yx Y ([2 n l)x Y 'M 



(2n) 



The slight variant 



• <5 



X y ([l 2 "]) 

of the orthogonality relations of the first kind thus gives 



Y.Y' 



[<x],[2"] ^[raUw] 



X Y ([2n}) 

x Y ([i 2n }) 



(2n-l)!! ^ x y ([2 n ])x y Wx Y (N) 

II .r •*>•! 



E 



x r ([l2 „i 
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since r € [2n] completing the proof of (0]). 

Plugging this into our expression for P(n, N), we find 

%*i = (*-»^ ? Wh £ "^vm (•) 

V v ' 

(*) 

since 

W J - -es 2 „ INI (n,.7" J •^!J 

(2n)! V 

Rewriting the factor 

7Tj \h=l I 

as a product of power sums Pi(x\, . . . ,xn) = J2h=i x \> we identify via the 
Frobenius Theorem [27] the term (*) in cq. ([5]) as a special value of the Schur 
polynomial sy (xi, . . . , Xjv) of Y over N > 2n indctcrminatcs, namely, 

Mi, • • • , i) = 7^1 E x y W n»(i. - . i)" = T^yy E II f E x y «. 

(6) 

We use the Murnaghan-Nakayama rule [3T] 

X Y ((h,...,i m )cr)= E (-r^'x^W (7) 

skew hook of length tti 

to explicitly compute the remaining character values, where wiY^) equals the 
number of rows in the skew hook minus one. Let Y p ^ q denote a (p, g)-hook 
Young diagram having a single row of length q + 1 > 1 and p rows of size one, 
where p + q + 1 = 2n. It follows that x y ■ • ■ , in)) — (~ l) p <5yy since for 
a = (ii, . . . , i n ) the only skew hook of length 2n is a hook of length 2n itself. 
Since r 6 [2n], this implies that only Young diagrams Y PtQ contribute to the 
sum in eq. (J5J), and setting x p ' q — X Yp,q , we arrive at 

Eq. (J7| furthermore implies 

r yP , 9 -» l((j) + ( _ ir -i x P-™, g((T) . form< 2n, 
X P ' 9 (^l,...,l m )cr) = < (9) 

(— l) p ; for m = 2n, 
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where x p m,q an d X p,q ™ are zer0 m the respective cases p — m < and 
q — m < 0. This recursion allows us to prove via induction 

fC-l^CV 1 ); forp = mod 2, 

X M ([2"]) = u A? n -u , 1 , 9 (10) 
l( — 1) 2 (p^i); forp = l mod 2, 

xPi9([l2n])= /2n-l\ (n) 



q 

Now, according to the definition of Schur polynomials, we have 

s x (x 1 ,x 2 , ...,x n ) = ^2x T = ^2x[ 1 - --x^, (12) 

T T 

where the summation is over all semistandard Young tableaux T of shape A, 
and U counts the occurrences of the number i in T. Thus, s Pi<? (l, . . . , 1) counts 

N 

the number of semistandard Young tableaux T of shape (<?+ 1, 1, . . . , 1) whose 

p 

contents are integers not larger than N, which we next compute. If the first 
element in the first row is i, where 1 < i < N —p, then there are ( ~') ways to 
arrange the numbers in the first column. Since the first row weakly increases, 
the remaining q elements in the first row can be chosen from i to N with 
repetition. There are thus ( ways to choose q elements in the first row. 

We conclude that the number of desired semistandard Young tableaux is 

N ~ p 7 iV -i\(N + q- i\ (N + q \ (2n - 1 



and hence 

Consequently, we arrive at 

" 'N + 2n - 2j - 1\ fN + 2n- 2j - 2 



(2n-l)!! f£ ) \ j 



2n \ 2n 



( n _ 1\ 1 f (l + x \N+2n-2j-l M , \N+2n-2j-2 



i=o \ ■> / •> 



27TZ , 
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where z = (1 + 2a:)- 1 . We have thus proved 2 ff^l, = [z n+1 ] (j^) 
cquivalcntly 

- P(n,N) fl + z^ 

For convenience, we now introduce 

P(n,N) 



N 

, or 



p(n,N) = 



(2n-l)!!' 
and claim that 



l + 2^p(n,x)z" +1 = (15) 



n=0 



for an integral indeterminate X, i.e., we now drop the restriction that N > 
2n. Of course, p(n,x) is determined by {p(n, A^) | Af > 2n} since any two 
polynomials that coincide on an arbitrarily large set must be identical. The 
identity 

'i _i_ \ N /i j- N N ~ 1 

(1 + 2z + 2z 2 + 2z 3 + ...)' 



l-z J K ' \ 1 -z, 

gives 

p(n,N) = l+p(n,N-l)+2(p(n-l,N-l)+p(n-2,N-l)+---+p(l,N-l)), 
whence 

p(n, N) - p(n — 1,N)= p{n, N - 1) - p(n - 1, N - 1) + 2p(n - 1, JV - 1), 

from which it follows that 

p(n, N) = p(n, N - 1) + p(n - 1, N) + p(n - 1, N - 1), for all N > 2n, 

where p(0, N) = N and p(n, 0) = 0, for n, N > 0. This recursion guarantees 
the identity of polynomials 

p(n, x) = p(n, x — 1) + p{n — l,x) + pin — 1, X — 1), 

where p(0, x) = a; and p(n, 0) = 0, for n > 0. At the same time, 

•l + z N " 

n=0 



1 



l + 2^fe(n,a;) 



gives b(n, x) = b{n, x — 1) + — 1, x) + 6(n — 1, a; — 1), where &(0, x) = x 
and 6(n, 0) = 0, for n > 0. Thus, the two sides of eq. ([T5j) satisfy the same 
recursions and initial conditions, guaranteeing their equality, as was claimed, 
completing the proof of the lemma. 
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Lemma 2 The c g (ri) satisfy the recursion 

(n + 1) c g (n) = 2(2n - 1) c g (n - 1) + (2n - l)(n - l)(2n - 3) c g _i(n - 2^0) 
where c g (n) — /or 2g > n. 
Proof On the one hand by definition. 

— p(n, x)z n+1 = ^2 ( n + l)K n > a; ) z " 

n>0 n>0 

ti>0 2g<n v ' 

so the coefficient of x n + 1 ~ 2 a z n \ s L!i±^gW , On the other hand by eq. (fT5|) . 



n>0 



^(1 + 21^^ 

n>0 

c fl (x) 



+11 



V 



V ^ (2n- 1)!! ; ^ 

n>02g<n v ' j>0 

has 2 J3?_ (2( 8 w-i-"2j)^i)!! as ^ S coefficient of .t" +1 ~ 2£, z 9 . Equating these two 
coefficients, we obtain 

(n + l)c g (n) = c g -j(7i- l-2j) 

(2n-l)!! Z.( 2 (n-l-2j)-l)!!' 

and hence 

(n + l)c g (n) _ (n - l)c g _i(n - 2) 
(2n-l)!! (2(n-2)-l)!! 

= 2 c g _j(n- 1 - 2j) _ „ y> c g _i_j(n - 3 - 2j) 
( 2 ( n _ i _ 2 j) - 1)!! ^ (2(n - 3 - 2j) - 1)!! 

n eg!"- 1 ) 



(2(n-l) - 1)!!' 

as required. 

Theorem 1 For any g > 1 i/ie generating function C g (z) = X/n>0 c g( n ) zn * s 
given by 
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where P g (z) is a polynomial with integral coefficients of degree at most (3<? — 
1), P g (1/4) ^ 0, [z 2 a]P g (z) ^ and [z h ]P g (z) = for < h < 2g - 1. 
In particular, C g (z) is algebraic over C(z) and has its unique singularity at 
2=1/4 independent of genus. Furthermore, the coefficients ofC g (z) have the 
asymptotics 

r(3 fl -i/2)- 



[z n ]C g (z) ~ r,^. , /n ^ n 3g ~U n . (18) 



Proof The recursion eq. (JTHJ) is equivalent to the ODE 



z(l - 4z) — C s (z) + (1 - 2z)C 9 (z) = #g_i(z), (19) 



where 



^-i(^) = z 2 ^4z 3 ^C g _!(z) + 24z 2 ^C 3 „ 1 (z) + 27z^C 9 _ 1 (z) + 3C g _!(z) 

with initial condition C ff (0) = since r = n + 1 — 2g has no positive solution 
r > for n < 2g. 

We prove the theorem by induction on g. For the basis step g = 1, we have 
the generating function Co(z) = 2 (l + yl — 4 z) for the Catalan numbers. 
The Picard-Lindelof Theorem [T5] guarantees the unique solution to eq. (|19D , 
satisfying Ci(0) = 0, given by 



Ci(z) = ^Vl-42. (20) 

(l-4z) 3 

For the inductive step, the induction hypothesis gives that for any j < g, 
we have 

where Pj(x) is an integral polynomial of degree at most 3j — 1, P,-(l/4) ^ 0, 
[z 2j ]Pj{z) ^ 0, and [z h }P 3 (z) = for < h < 2j - 1. The general solution of 
eq. ([15} is 



where 



= 4z 5 ^C s (z) + 24z 4 ^C 3 (z) + 27z 3 ^C 9 (z) + 3z 2 C 9 (z) 
gi(g) 

(1 _ 4z )3 ff +5/2- 

We claim that 



• Q g (z) is a polynomial of degree at most 3<? + 2, 

• Q s (l/4) ^ 0, and 
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• [z 2 3+ 2 ]Q g (z) ^ and [z h ]Q g {z) = if < h < 2g + 1. 
To these ends and by the inductive hypothesis C g (z) = P g (z)/(1 — 4z) 3£,_1 / 2 , 



4z)P g (z) + (12 ff -2)P g (z), 
4z)P{ g (z) + (12g + 2)P lg (z), 
iz)Pi g (z) + (12g + 6)P 2g (z). 



3 (z) = 4z 5 P 39 (^)+24z 4 (l-4z)P 2s (z)+27z 3 (l-4z) 2 F lg (z)+3z 2 (l-4z) 3 P g (z). 

(22) 

To see that Q g {z) is indeed a polynomial of degree at most 3g + 2, first 
note that P g (z) and each Pi g (z), for 1 < i < 3, are each polynomials of degree 
at most 3g — 1, so the degree of Q g (z) is at most 3g + 4. We shall compute the 
coefficients [z 3g+3 }Q g (z) and [z 3s+4 ]Q g (z) in terms of d 3g ^ 2 = [z 3g - 2 ]P g {z) 
and d 3g _i = [z 3s_1 ]P g (z), where we find 



we have 

- (l-4z)W whcrC Pl « (z) - (1 
d 2 C g (z) _ P 2g (z) 

-^2" - (i_4z)W whcrc P ^ z > V 
d 3 C g (z) P 3g (z) 

-d^T" = (1-4^/2 ' whCTC P ^ = ^ 
Thus, 



[z 3s - 1 ]P lg (z) = 2d 3ff -i, 

[z 3 ^ 2 ]P lg (z) = (3.g - l)d 3g _i + 6d 3g - 2j 

[z 39 - l ]P 2g (z) = 6[z 3 ^ 1 ]P lg (z) = 12d 3fl -i, 
[z 3 ^ 2 ]P 2g (z) = (3.g- lMz^P^z) + 10[z 3 ^ 2 ]P lg (z) 
= 12(3.9- l)ds fl -i +60d 3s -2, 

[z 39 - l ]P ig {z) = 10[^ 3ff - 1 ]P 2s (z) = 120d 3fl -l, 
[z 3 ^ 2 ]P 3g (z) = (3.9 - l)^" 1 ]^*) + 14[z 3 ^ 2 ]P 2g (z) 
= 180(3.9 -l)d 3g _i+840d 3g _ 2 , 



Plugging these into eq. (f2"2"f. we compute 



[z 3 *+ 4 ]Q g (z) = 4[z 3 ^ 1 ]P 3g (z) - 24 x 4 x [z 3 ^]P 2g (z) 

+27 x (-4) 2 x [z 39 - l ]P lg {z) + 3 x (-4) 3 x d 3g _! 

= 0, 
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and 

[z 3 v +3 ]Q g (z) = 4[z 3 ^ 2 ]P 3g (z) + 24[z 3 °- 1 ]P 2g (z) - 24 x 4 x [z 3 ^]P 2g ( z ) 
+27 x (-8) x [z^]P lg { z ) + 27 x (-4) 2 x {z 3 °- 2 ]P lg (z) 
+3 x 3 x (-4) 2 x d 3g _i + 3 x (-4) 3 x d 3g - 2 

= 4 x (180(3 5 - 1) (kg-! + 840 d 3s _ 2 ) + 24 x 12 d 3ff -i 
-96 x (12(3 5 - 1) d 3g _i + 60 d 3ff _ 2 ) 
-27 x 16 d 3ff _i + 27 x 16 x {(3g - 1) d 3s _i + 6 d 3ff _ 2 ) 
+ 144d 3ff _ 1 -192d 3g _ 2 

= 0. 

Furthermore, insofar as P g (l/4) ^ 0, we have P, s (l/4) ^ 0, for 1 < i < 3, 
and hence also Q g (l/4) ^ from eq. (f2"2")l as was claimed. 
We finally show 

[z 2g+2 ]Q g {z) ^ and [z h ]Q g {z) = 0, for < h < 2g + 1. 

By the induction hypothesis, we have [z 2g ]P g (z) ^ and \z h ]P g (z) = 0, for 
< h < 2g. By definition of Pi g , P 2g , p3 g , we have, for g > 2, 

lz 2 °- 1 }P lg (z) = 2g{z 2 v}P g (z), 

[z 29 - 2 )P2 3 (z) = (2.g - l)[z 2 ^]P lg (z) = (2g - l)2 5 [z 2 *]P g (z) 
lz 2 °- 3 ]P 3g (z) = (2g 2)[z 2g ' 2 ]P 2g (z) = (2 5 - 2)(2g - l)2 ff [z 2 *]P g (z), 

and consequently conclude 

[z 2g+2 ]Q g (z) = A[z 2 v~ 3 ]P 3g {z) + 2A[z 2 ^ 2 ]P 2g {z) + 27[z 2 ^ 1 ]P lg (z) + S[z 29 ]P g (z) 
^0 

as was asserted. 

We proceed by extracting the following coefficients: 

[z h ]P lg {z) = (h+ l)[z h+1 ]P g {z) Ah[z h ]P g {z) + (12g 2)[z h ]P g (z) 

= 0, for < h< 2g - 1, 
[z h ]P 2g (z) = (h + l)[z h+1 }P lg (z) - 4h[z h ]P lg {z) + (12.9 + 2)[z h ]P lg (z) 

= 0, for < h < 2g - 2, 
[z h ]P 3g (z) = (h + l)[z h+1 ]P 2g (z) - Ah[z h ]P 2g (z) + (12.9 + e)[z h ]P 2g (z) 

= 0, for < h< 2g - 3. 

For < h < 2g+ 1, we therefore conclude 

[z h ]Q g (z) = 4[z h - 5 ]P 3g (z) + 24[z"- 4 ]P 29 (z) - 96[z fe - 5 ]P 2ff (z) 

+27[z ;i - 3 ]P l9 (z) - 27 x 8[z h - 4 ]P l9 (z) + 27 x 16[z h - 5 ]P lg (z) 
+3[z'- 2 ]P g (z) - 36[z^ 3 ]P 3 (z) + 144[z^ 4 ]P 9 (z) - 192[z'- 5 ]P g (z) 
= 



Enumeration of linear chord diagrams 



19 



as required, completing the verifications of our assertions about Q g . 

Now, since Q g (z) is a polynomial of degree at most 3g + 2, the partial 
fraction expansion is given by 

Qg( z ) _ A j 

(1-Azf^ ^(i-4 Z y [ ' 

where the Aj € Q. In light of eq. (f2"Tj) . we then compute 



, g+1 {z) = y 3 - + C 




3 9 +4 

, Aj . (1-4 z)° s ^ + C (1 - 4 z) 
2 4(i-l)^ ^ k ' j(l-4z) 39+ 



n /i „\3s+4-j , n A ,n3 9 +3 1 VI - 4; 



where the initial condition C g+ i (0) = evidently guarantees C = X^'=2 4( 3 -i) ■ 
Introducing the Laurent polynomial 



1 / 39+4 4- 33+4 4 

\J=2 J 3=2 J 



l-4z) 39+3 



(24) 
we claim that 



• P g +x(z) is a polynomial, i.e., [z 1 }P g +i(z) = 0, of degree at most 3g + 2, 

• P fl +i(l/4) ^0, and 

• [z 2 a+ 2 ]P g+l (z) ^ and [z h ]P g+1 {z) = if < h < 2g + 1. 
To these ends, we have 

W - g ^ - *) + g £ ( 3 f + + f) . ,25, 

In particular, [z^Pg+iO) = £ 3 £+ 4 jg^+T, 3 /^ 477=1! = 0, whence P g+1 {z) 
is indeed a polynomial of degree at most 3g+2. Furthermore, we find P s +i(l/4) = 
A 3g+4 /(3g + 3) ^ directly from eq. (J23J). 

It remains to show by induction on s that [z s ]P g+ i (z) = 0, for < s < 2g+l 
and [z 2a+2 ]P g+ i{z) ^ 0. We have seen that the coefficients [z 2a+2 ]Q g (z) ^ 
and [z h ]Q g {z) = 0, for < h < 2g + 1, that is, from eq. ([2"5]). 

and 



2.9 + 2 

(_ 4 )fc y V ~ J U -=0, for 0</!< 2.9+1. (27) 



39+4-ft / 

35 + 4 - j 



J=2 
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It follows from eq. (j2"T)l that 



3g+4 A 3g+4 ^ 



[z°)P g+1 (z) =^2^(3g + 4-j)+^2 -^t(3.9 + 3) 

i — ' 7 — 1 * — ' 7 — 1 

3=2 J 3=2 J 

3g+4 
3=2 

= o, 



and we hence assume by induction that [z s ]P g +i(z) = for < s < 2g + 1. 
To compute [z s+1 ]P g+ i(z), we thus assume 

J — 2 j — 2 

(28) 

i-e., EjS* & = m 4 -d compute 



- ■ l, j - 1 v 8 +i A fl + 2 j 




so that 



(-4) 



-< s+1 > [z s+1 ]P ff+ i(z) 



3g+4 

£ 

3=2 



A, 



s + 2 



3g + 4 - 
s + 1 



(29) 



according to eq. (f2~T|) . completing the inductive proof that indeed the coeffi- 
cients [z s ]P g+ i(z) = vanish, for < s < 2g + 1. 
Similarly, using 

J— 2 j— 2 

(30) 
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and eq. (J2HJ) , it follows that the coefficient 



,1, (:>,, • 1 A 3 4^ 4 A j f.U, ■ :', 



2 j - 1 V 2. 9 + 3 J \2g + 3 

39+4 - Aj f3g + A-j 



( _ 4) -( 29+2 )^ +2]Ps+l(z) = E f£L^ + ~ ' j + J2 _i_ 

j — ^ 

3 ^ 4 -A ■ 
j _ i 

E 



2 J - 1 V 2. 9 + 2 J V 2 3 
1 A,- /3 5 + 3\/.9 + l 



J-1V2.9 + 2/ V2ff + 3 

£j 2 9 + 3V 2 s + 2 ; ^ 

completing the verification of our claims about P g +i- 

To complete the proof, we must finally show that P g (z) has integral coef- 
ficients, and this follows immediately from the expression 

69-1 

P g (z) = C g (z)(VT^) 6g ^ = C g (z)(l-2zC (z/ 

since the right-hand side evidently has all integral coefficients. 
Corollary 1 We have the explicit expressions 

, . 2™- 2 (2n- 1)!! „ 
C ^= 3(n-2)! ' f ° r ^ 2 ' 

. , 2™- 4 (5n- 2)(2n - 1)!! r 
C ^= 90(n 4)! > 

, , 2"~ 6 (35n 2 - 77n+ 12) (2n- 1)!! „ 
C3in) = 5670(n-6)! ' " * 6 ' 

Proof Differentiating the formula for C±(z) using the explicit expression for 
Pi(z) in the Introduction, we get (1 — 4z)zCi(z) — (2 + 2z)C%(z) — giving 
the recursion (n — 2)ci(n) = 2(2n — l)ci(n — 1), whose solution is given by 
the asserted expression for Ci(n). As to 02(71) and again using the explicit 
expression for P-z{z) in the Introduction, observe that C2(z) satisfies the ODE 

5z 2 (4z - + 2 <^z + 11)^^ + 2(3z - U)C 2 (z) = 

dz z dz 

giving the recursion (5n 2 — 27n + 28)02(77) = (20n 2 — 18n + 4)c2(n — 1), which 
gives the asserted formula. For genus 3 we similarly get the ODE 

(140z 4 -35z 3 ) d3C ^V (462z 3 -252z 2 ) d2C3 J z) +(482 2 -684z)^^-(60z-744)C 3 (z) = 
dz 6 dz 1 dz 

which gives the recursion 

(35n 3 - 357?i 2 + IOO677 - 744)c 3 (n) = (140n 3 - 378n 2 + 202ti - 24)c 3 (n - 1) 
solved by the stated formula for 03(71). 
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Corollary 2 We have the explicit expression 

(45)! 



c s (2 5 ) 



Furthermore, the exponential generating function of these numbers is given by 



v 2g s/T+2x - y/l-2x 



(2a)! 2x 

g=0 V y > 

Proof According to the recursion eq. (|16p . we have 

(2g + l)c g (2g) = (2g - l)(4g - 1)(4 5 - 3)c 3 „ 1 (2.g - 2) 

since c g (2g — 1) = from Theorem [TJ The asserted formula is the unique 
solution of this recursion with initial value ci (2) = 1. For the second assertion, 
we compute 

v x2g -V (4g)! ( x \ 2g 

g =o a9 ( 2 -9) ! ^(2<?)!(2 5 + l)! V2/ 
_ C(f) + C(-f) 



_ s/l + 2x- v 7 ! - 2x 
~~ 2~x 

as was claimed. 

The formulas for c g (2g) and C\(n) were given in [5]. In fact and as illus- 
trated in the proofs, our methods provide a framework for computing explicit 
expressions of c g (n), for any g. 



4 Macromolecular diagrams of genus g 

We extend the enumerative results of Theorem [T] to macromolecular diagrams 
by first specializing to shapes and then modifying shapes to produce macro- 
molecular diagrams. 

Lemma 3 Suppose g is a non-negative integer. Then 

Sg{z , U ) = 1 + 2 C g ( , Z{l + Z) \ . (31) 
9V ' l + 2z-zu 9 V(l + 2z - zuf J K 1 



Enumeration of linear chord diagrams 



23 



Proof We first prove 

c ^ = 7rh^ c '{ ( X+ i- y ^ ) (32) 

and to this end, choose £ G ^(s-f 1, to+ 1) and label one of its 1-chords. Since 
we can label any of the (m + 1) 1-chords of £, (m + 1) c s (s + 1,to+1) different 
such labeled linear chord diagrams arise. On the other hand, to produce £ with 
this labeling, we can add one labeled 1-chord to an element of ff g (s, m + 1) by 
inserting a parallel copy of an existing 1-chord or by inserting a new labeled 1- 
chord in an element of ^(s, to), where we may only insert the 1-chord between 
two vertices not already forming a 1-chord. It follows that we have the recursion 

(m + 1) c g (n + l,m + 1) = (m + 1) c g (n 1 m + 1) + (2n + 1 — to) c g (n,m) 

or equivalently the PDE 

dC g (x,y) _ dC g (x,y) 2 dC g (x,y) dC g (x,y) 

dy ~ X dy +2X dx + x< -9^V)-xy g y , (33) 

which is thus satisfied by C g (x,y). 
On the other hand, 

= — 7 C * fr - TT~ vi 

a; + 1 — yx \ [x + 1 — yx)' 

is also a solution of eq. (f3"3"| . which specializes to C g (x) = C*(ar, 1), and 
moreover, we have c*(n, m) = [x n y m ]C*(x, y) = 0, for m > n. Indeed, the 
first assertion is easily verified directly, the specialization is obvious, and the 
fact that y only appears in the power series C*(x, y) in the form of products xy 
implies that c*(n, m) = 0, for m > n. Thus, the coefficients c*(n, to) satisfy the 
same recursion and initial conditions as c g (n, to), and hence by induction on 
n, we conclude c*(n, to) = c s (n, to), for n, to > 0. This proves that C s (n, to) 
indeed satisfies eq. (|3"2j) as was claimed. 

To complete the proof of eq. ([3~Tj) . we use that the projection ■& is surjective 
and affects neither the genus nor the number of 1-chords, namely, 

C g (x,y) =Y1 c y( x ,y)- 

771>0 T having genus g 
— and m 1-chords 

Furthermore, if a shape 7 has s chords, of which t are 1-chords, then we 
evidently have 



C 7 (x,y) = y 



which shows that C 7 (x, y) depends only on the total number of chords and 
number of 1-chords in 7. Consequently, 

c ^y) = E E c ^y) = E E s ^ m ) {^)y m = ^ 

m>0 7 having genus g S>0 771 — 

— and m 1-chords — 

(34) 
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Setting z = i.e., x = yqrj, and u = y, we arrive at 

1 + z ( z(l + z) 



; l + 2z-«* B V(l + 2«-«ii)V' 
as required. 

Lemma 4 For any shape 7 wii/i s > 1 chords and m > 1- chords, we have 



^ Z ' ) 1 ((1-z 2 )(1-^) 2 -(2z-z 2 )z 2ct ) 



In particular, D 7i<T (z) depends only upon the number of chords and 1-chords 
in 7. 

Proof We shall construct U„>rjD 7j(T (n) with simple combinatorial building 
blocks. As a point of notation and as usual, if X = U n >oX(n) is a collection of 
sets of partial matchings on n > vertices, then we consider the correspond- 
ing generating function X(z) = Yl n >o x ( n ) 2 "- I n particular, we have the set Z 
consisting of a single vertex with generating function Z(z) = z and the set 3? 
consisting of a single arc and no additional vertices with generating function 
R(z) =z 2 . 

Let = denote set-theoretic bijection, + disjoint union, x Cartesian product 
with iteration written as exponentiation, J the empty set, and Seq(X) = 3 + 
X + X 2 + ■ ■ ■ , for any collection X. 

Define the set L = Seq(Z) consisting of any number n > of isolated 
vertices and no chords, with its generating function L(z) = 1/(1 — z), and the 
set 3C 7 comprised of a single stack with at least a > 1 arcs and no additional 
vertices, with its generating function K a (z) = z 2cr /(l — z). 

The collection W = X" X (z X L + Z X L + (Z X £) 2 ) of all single stacks 

together with a non-empty interval of unpaired vertices on at least one side 
thus has generating function 



1 — z 2 \ 1 — z \1 — z J I 

Furthermore, the collection JA" = X a x Seq(3\T' t ) of all pairs consisting of a 
stack X a and a (possibly empty) sequence of neighboring stacks likewise has 
generating function 

M"( 2 '- K ° <2) - ^ 



where only intervals of isolated vertices as are necessary to separate the neigh- 
boring stacks have been inserted in M CT . 

To complete the construction and count, we must still insert possible un- 
paired vertices at the remaining 2s + 1 possible locations, where there must 
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be a non-trivial such insertion between the endpoints of each 1-chord. These 
insertions correspond to £ 2 »+i— m x (Z x L) m , and we therefore conclude that 
Lln^oJV") = (M* 7 ) 5 x £ 2s + 1 ~ m x (Z x L) m has the asserted generating func- 
tion 



D TjCT (z) = 



V 



l-z 2 \ ' l 1-2 



1 - Z 



2s+l-?n 



1 - Z 



(1 ^ r '((T^KT 



..2a 



- z) 2 - (2z- z 2 )z 2 ° / 
Our main result about macromolecular diagrams follows. 
Theorem 2 Suppose g and a are positive natural numbers and let u a (z) = 

t 2\<T— 1 

z it_ z s _]_ x ■ Then the generating function D ffi<7 (z) is algebraic over C(x) and 
given by 



u a {z)z 1 



u a (z)z 2 -z + l 9 \{u a {z)z 2 - z + if ) 
In particular, for arbitrary but fixed g and 72 ~ 1.9685, we have 

[z n ]D ga {z)^k g n^-h)^^ 

for some constant k g . 



(35) 



(36) 



Proof Since each element !£>g :<T (n) projects to a unique shape 7 with genus g 
and some number to > of 1-chords. we have 



(37) 



m > T having genus g 
and m 1-chords 



According to Lemma 01 D 7 <T (z) only depends on the number of chords and 
1-chords of 7, and we can therefore express 



1 



■P-11 



2-1 y \(1 - z 2 ){\ - z) 2 - (2z - z 2 )z 2 
1 / z 2 u a (z) 



(1 - z) + u a {z)z 2 \[(\ - z) + u a {z)z 2 ) J 

using Lemma 13] in order to conhrm cq. (|35|) . where the second equality follows 
from direct computation. Let 



z 2 u a (z) 



{{l-z) + u a {z)z 2 Y 



denote the argument of C„ in this expression 
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Since any algebraic function is in particular D-finitc as well as Z\-analytic 
|32j . we conclude from Theorem Q] that 

C g (z) = x g (l-Az)- {39 - 1 ^(l + o(l)) for z^r 1/4, (38) 

for some constant x g . Since C g (z) is algebraic over K = C(z), there exist 
polynomials Ri(z), for i = 1, . . . , £, such that Y^\=i Ri( z ) C g (z) % = 0, whence 
Ei=i Ri{0 a {z)) C g (6 a (z)) 1 = as well. Setting L = C(8 a (z)), we thus have 

[L(C g (e a (z))) : K] = \L{C g {8 a {z))) : L] ■ [L : K] < oc, 

i.e., ~D gtG (z) is algebraic over K. Pringsheim's Theorem [T3] guarantees that 
for any er > 1, D g . CT (z) has a dominant real singularity 7^ > 0. 

In particular, for a — 2, we verify directly that 72 is the unique solution 
of minimum modulus of 8%{z) = 1/4, which is strictly smaller than any other 
singularities of 82(2) and satisfies #'(72) 7^ 0. It follows that Tig^iz) is gov- 
erned by the supercritical paradigm |13j . and hence D g ^{z) has the singular 
expansion 

D 3 . 2 (z) = fc;(l-7 2 r (3s - 1/2) (l+o(l)) for z^ 72, (39) 

for some constant k' g . 

For arbitrary but fixed g, we thus find the asymptotics 

[z^M^k.n 3 ^- 1 ^^, (40) 

where 72 ~ 1.9685 as was claimed. 
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